Dataset statistics
| Number of variables | 14 |
|---|---|
| Number of observations | 506 |
| Missing cells | 120 |
| Missing cells (%) | 1.7% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 55.5 KiB |
| Average record size in memory | 112.3 B |
Variable types
| NUM | 13 |
|---|---|
| BOOL | 1 |
TAX is highly correlated with RAD | High correlation |
RAD is highly correlated with TAX | High correlation |
CRIM has 20 (4.0%) missing values | Missing |
ZN has 20 (4.0%) missing values | Missing |
INDUS has 20 (4.0%) missing values | Missing |
CHAS has 20 (4.0%) missing values | Missing |
AGE has 20 (4.0%) missing values | Missing |
LSTAT has 20 (4.0%) missing values | Missing |
ZN has 360 (71.1%) zeros | Zeros |
Reproduction
| Analysis started | 2023-01-08 06:21:46.810638 |
|---|---|
| Analysis finished | 2023-01-08 06:22:58.801596 |
| Duration | 1 minute and 11.99 seconds |
| Software version | pandas-profiling v2.9.0 |
| Download configuration | config.yaml |
| Distinct | 484 |
|---|---|
| Distinct (%) | 99.6% |
| Missing | 20 |
| Missing (%) | 4.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.611873971 |
|---|---|
| Minimum | 0.00632 |
| Maximum | 88.9762 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 4.0 KiB |
Quantile statistics
| Minimum | 0.00632 |
|---|---|
| 5-th percentile | 0.02739 |
| Q1 | 0.0819 |
| median | 0.253715 |
| Q3 | 3.5602625 |
| 95-th percentile | 15.870875 |
| Maximum | 88.9762 |
| Range | 88.96988 |
| Interquartile range (IQR) | 3.4783625 |
Descriptive statistics
| Standard deviation | 8.72019185 |
|---|---|
| Coefficient of variation (CV) | 2.414312326 |
| Kurtosis | 36.56834838 |
| Mean | 3.611873971 |
| Median Absolute Deviation (MAD) | 0.218875 |
| Skewness | 5.21284265 |
| Sum | 1755.37075 |
| Variance | 76.0417459 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 0.01501 | 2 | 0.4% | |
| 14.3337 | 2 | 0.4% | |
| 0.04544 | 1 | 0.2% | |
| 0.02498 | 1 | 0.2% | |
| 0.01301 | 1 | 0.2% | |
| 0.06151 | 1 | 0.2% | |
| 0.05497 | 1 | 0.2% | |
| 0.03306 | 1 | 0.2% | |
| 0.03041 | 1 | 0.2% | |
| 0.03427 | 1 | 0.2% | |
| Other values (474) | 474 | 93.7% | |
| (Missing) | 20 | 4.0% |
| Value | Count | Frequency (%) | |
| 0.00632 | 1 | 0.2% | |
| 0.00906 | 1 | 0.2% | |
| 0.01096 | 1 | 0.2% | |
| 0.01301 | 1 | 0.2% | |
| 0.01311 | 1 | 0.2% |
| Value | Count | Frequency (%) | |
| 88.9762 | 1 | 0.2% | |
| 73.5341 | 1 | 0.2% | |
| 67.9208 | 1 | 0.2% | |
| 51.1358 | 1 | 0.2% | |
| 45.7461 | 1 | 0.2% |
| Distinct | 26 |
|---|---|
| Distinct (%) | 5.3% |
| Missing | 20 |
| Missing (%) | 4.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 11.21193416 |
|---|---|
| Minimum | 0 |
| Maximum | 100 |
| Zeros | 360 |
| Zeros (%) | 71.1% |
| Memory size | 4.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 12.5 |
| 95-th percentile | 80 |
| Maximum | 100 |
| Range | 100 |
| Interquartile range (IQR) | 12.5 |
Descriptive statistics
| Standard deviation | 23.38887615 |
|---|---|
| Coefficient of variation (CV) | 2.086069702 |
| Kurtosis | 4.132614189 |
| Mean | 11.21193416 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 2.256612605 |
| Sum | 5449 |
| Variance | 547.0395274 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=26)
| Value | Count | Frequency (%) | |
| 0 | 360 | 71.1% | |
| 20 | 20 | 4.0% | |
| 80 | 14 | 2.8% | |
| 22 | 10 | 2.0% | |
| 25 | 10 | 2.0% | |
| 12.5 | 10 | 2.0% | |
| 40 | 6 | 1.2% | |
| 45 | 6 | 1.2% | |
| 90 | 5 | 1.0% | |
| 30 | 5 | 1.0% | |
| Other values (16) | 40 | 7.9% | |
| (Missing) | 20 | 4.0% |
| Value | Count | Frequency (%) | |
| 0 | 360 | 71.1% | |
| 12.5 | 10 | 2.0% | |
| 17.5 | 1 | 0.2% | |
| 18 | 1 | 0.2% | |
| 20 | 20 | 4.0% |
| Value | Count | Frequency (%) | |
| 100 | 1 | 0.2% | |
| 95 | 4 | 0.8% | |
| 90 | 5 | 1.0% | |
| 85 | 2 | 0.4% | |
| 82.5 | 2 | 0.4% |
| Distinct | 76 |
|---|---|
| Distinct (%) | 15.6% |
| Missing | 20 |
| Missing (%) | 4.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 11.08399177 |
|---|---|
| Minimum | 0.46 |
| Maximum | 27.74 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 4.0 KiB |
Quantile statistics
| Minimum | 0.46 |
|---|---|
| 5-th percentile | 2.18 |
| Q1 | 5.19 |
| median | 9.69 |
| Q3 | 18.1 |
| 95-th percentile | 21.3125 |
| Maximum | 27.74 |
| Range | 27.28 |
| Interquartile range (IQR) | 12.91 |
Descriptive statistics
| Standard deviation | 6.835896499 |
|---|---|
| Coefficient of variation (CV) | 0.6167359775 |
| Kurtosis | -1.217990915 |
| Mean | 11.08399177 |
| Median Absolute Deviation (MAD) | 6.32 |
| Skewness | 0.3037221876 |
| Sum | 5386.82 |
| Variance | 46.72948094 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 18.1 | 127 | 25.1% | |
| 19.58 | 28 | 5.5% | |
| 8.14 | 22 | 4.3% | |
| 6.2 | 18 | 3.6% | |
| 21.89 | 14 | 2.8% | |
| 9.9 | 12 | 2.4% | |
| 3.97 | 12 | 2.4% | |
| 8.56 | 11 | 2.2% | |
| 10.59 | 11 | 2.2% | |
| 5.86 | 9 | 1.8% | |
| Other values (66) | 222 | 43.9% | |
| (Missing) | 20 | 4.0% |
| Value | Count | Frequency (%) | |
| 0.46 | 1 | 0.2% | |
| 0.74 | 1 | 0.2% | |
| 1.21 | 1 | 0.2% | |
| 1.22 | 1 | 0.2% | |
| 1.25 | 2 | 0.4% |
| Value | Count | Frequency (%) | |
| 27.74 | 5 | 1.0% | |
| 25.65 | 6 | 1.2% | |
| 21.89 | 14 | 2.8% | |
| 19.58 | 28 | 5.5% | |
| 18.1 | 127 | 25.1% |
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 20 |
| Missing (%) | 4.0% |
| Memory size | 4.0 KiB |
| 0 | |
|---|---|
| 1 | 34 |
| (Missing) | 20 |
| Value | Count | Frequency (%) | |
| 0 | 452 | 89.3% | |
| 1 | 34 | 6.7% | |
| (Missing) | 20 | 4.0% |
NOX
Real number (ℝ≥0)
| Distinct | 81 |
|---|---|
| Distinct (%) | 16.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.5546950593 |
|---|---|
| Minimum | 0.385 |
| Maximum | 0.871 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 4.0 KiB |
Quantile statistics
| Minimum | 0.385 |
|---|---|
| 5-th percentile | 0.40925 |
| Q1 | 0.449 |
| median | 0.538 |
| Q3 | 0.624 |
| 95-th percentile | 0.74 |
| Maximum | 0.871 |
| Range | 0.486 |
| Interquartile range (IQR) | 0.175 |
Descriptive statistics
| Standard deviation | 0.1158776757 |
|---|---|
| Coefficient of variation (CV) | 0.2089033853 |
| Kurtosis | -0.06466713337 |
| Mean | 0.5546950593 |
| Median Absolute Deviation (MAD) | 0.0875 |
| Skewness | 0.7293079225 |
| Sum | 280.6757 |
| Variance | 0.01342763572 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 0.538 | 23 | 4.5% | |
| 0.713 | 18 | 3.6% | |
| 0.437 | 17 | 3.4% | |
| 0.871 | 16 | 3.2% | |
| 0.624 | 15 | 3.0% | |
| 0.489 | 15 | 3.0% | |
| 0.693 | 14 | 2.8% | |
| 0.605 | 14 | 2.8% | |
| 0.74 | 13 | 2.6% | |
| 0.544 | 12 | 2.4% | |
| Other values (71) | 349 | 69.0% |
| Value | Count | Frequency (%) | |
| 0.385 | 1 | 0.2% | |
| 0.389 | 1 | 0.2% | |
| 0.392 | 2 | 0.4% | |
| 0.394 | 1 | 0.2% | |
| 0.398 | 2 | 0.4% |
| Value | Count | Frequency (%) | |
| 0.871 | 16 | 3.2% | |
| 0.77 | 8 | 1.6% | |
| 0.74 | 13 | 2.6% | |
| 0.718 | 6 | 1.2% | |
| 0.713 | 18 | 3.6% |
RM
Real number (ℝ≥0)
| Distinct | 446 |
|---|---|
| Distinct (%) | 88.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.284634387 |
|---|---|
| Minimum | 3.561 |
| Maximum | 8.78 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 4.0 KiB |
Quantile statistics
| Minimum | 3.561 |
|---|---|
| 5-th percentile | 5.314 |
| Q1 | 5.8855 |
| median | 6.2085 |
| Q3 | 6.6235 |
| 95-th percentile | 7.5875 |
| Maximum | 8.78 |
| Range | 5.219 |
| Interquartile range (IQR) | 0.738 |
Descriptive statistics
| Standard deviation | 0.7026171434 |
|---|---|
| Coefficient of variation (CV) | 0.1117992074 |
| Kurtosis | 1.891500366 |
| Mean | 6.284634387 |
| Median Absolute Deviation (MAD) | 0.3455 |
| Skewness | 0.4036121333 |
| Sum | 3180.025 |
| Variance | 0.4936708502 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 5.713 | 3 | 0.6% | |
| 6.167 | 3 | 0.6% | |
| 6.127 | 3 | 0.6% | |
| 6.229 | 3 | 0.6% | |
| 6.405 | 3 | 0.6% | |
| 6.417 | 3 | 0.6% | |
| 6.782 | 2 | 0.4% | |
| 6.951 | 2 | 0.4% | |
| 6.63 | 2 | 0.4% | |
| 6.312 | 2 | 0.4% | |
| Other values (436) | 480 | 94.9% |
| Value | Count | Frequency (%) | |
| 3.561 | 1 | 0.2% | |
| 3.863 | 1 | 0.2% | |
| 4.138 | 2 | 0.4% | |
| 4.368 | 1 | 0.2% | |
| 4.519 | 1 | 0.2% |
| Value | Count | Frequency (%) | |
| 8.78 | 1 | 0.2% | |
| 8.725 | 1 | 0.2% | |
| 8.704 | 1 | 0.2% | |
| 8.398 | 1 | 0.2% | |
| 8.375 | 1 | 0.2% |
| Distinct | 348 |
|---|---|
| Distinct (%) | 71.6% |
| Missing | 20 |
| Missing (%) | 4.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 68.51851852 |
|---|---|
| Minimum | 2.9 |
| Maximum | 100 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 4.0 KiB |
Quantile statistics
| Minimum | 2.9 |
|---|---|
| 5-th percentile | 17.95 |
| Q1 | 45.175 |
| median | 76.8 |
| Q3 | 93.975 |
| 95-th percentile | 100 |
| Maximum | 100 |
| Range | 97.1 |
| Interquartile range (IQR) | 48.8 |
Descriptive statistics
| Standard deviation | 27.99951301 |
|---|---|
| Coefficient of variation (CV) | 0.4086415412 |
| Kurtosis | -0.9821403245 |
| Mean | 68.51851852 |
| Median Absolute Deviation (MAD) | 20.15 |
| Skewness | -0.5824700575 |
| Sum | 33300 |
| Variance | 783.9727285 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 100 | 42 | 8.3% | |
| 97.9 | 4 | 0.8% | |
| 87.9 | 4 | 0.8% | |
| 98.8 | 4 | 0.8% | |
| 96 | 4 | 0.8% | |
| 95.4 | 4 | 0.8% | |
| 76.5 | 3 | 0.6% | |
| 97 | 3 | 0.6% | |
| 96.2 | 3 | 0.6% | |
| 32.2 | 3 | 0.6% | |
| Other values (338) | 412 | 81.4% | |
| (Missing) | 20 | 4.0% |
| Value | Count | Frequency (%) | |
| 2.9 | 1 | 0.2% | |
| 6.2 | 1 | 0.2% | |
| 6.5 | 1 | 0.2% | |
| 6.6 | 2 | 0.4% | |
| 6.8 | 1 | 0.2% |
| Value | Count | Frequency (%) | |
| 100 | 42 | 8.3% | |
| 99.3 | 1 | 0.2% | |
| 99.1 | 1 | 0.2% | |
| 98.9 | 3 | 0.6% | |
| 98.8 | 4 | 0.8% |
DIS
Real number (ℝ≥0)
| Distinct | 412 |
|---|---|
| Distinct (%) | 81.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.795042688 |
|---|---|
| Minimum | 1.1296 |
| Maximum | 12.1265 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 4.0 KiB |
Quantile statistics
| Minimum | 1.1296 |
|---|---|
| 5-th percentile | 1.461975 |
| Q1 | 2.100175 |
| median | 3.20745 |
| Q3 | 5.188425 |
| 95-th percentile | 7.8278 |
| Maximum | 12.1265 |
| Range | 10.9969 |
| Interquartile range (IQR) | 3.08825 |
Descriptive statistics
| Standard deviation | 2.105710127 |
|---|---|
| Coefficient of variation (CV) | 0.5548580872 |
| Kurtosis | 0.4879411222 |
| Mean | 3.795042688 |
| Median Absolute Deviation (MAD) | 1.29115 |
| Skewness | 1.011780579 |
| Sum | 1920.2916 |
| Variance | 4.434015137 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 3.4952 | 5 | 1.0% | |
| 5.7209 | 4 | 0.8% | |
| 5.2873 | 4 | 0.8% | |
| 6.8147 | 4 | 0.8% | |
| 5.4007 | 4 | 0.8% | |
| 6.3361 | 3 | 0.6% | |
| 3.9454 | 3 | 0.6% | |
| 6.498 | 3 | 0.6% | |
| 4.7211 | 3 | 0.6% | |
| 4.8122 | 3 | 0.6% | |
| Other values (402) | 470 | 92.9% |
| Value | Count | Frequency (%) | |
| 1.1296 | 1 | 0.2% | |
| 1.137 | 1 | 0.2% | |
| 1.1691 | 1 | 0.2% | |
| 1.1742 | 1 | 0.2% | |
| 1.1781 | 1 | 0.2% |
| Value | Count | Frequency (%) | |
| 12.1265 | 1 | 0.2% | |
| 10.7103 | 2 | 0.4% | |
| 10.5857 | 2 | 0.4% | |
| 9.2229 | 1 | 0.2% | |
| 9.2203 | 2 | 0.4% |
| Distinct | 9 |
|---|---|
| Distinct (%) | 1.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9.549407115 |
|---|---|
| Minimum | 1 |
| Maximum | 24 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 4.0 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 4 |
| median | 5 |
| Q3 | 24 |
| 95-th percentile | 24 |
| Maximum | 24 |
| Range | 23 |
| Interquartile range (IQR) | 20 |
Descriptive statistics
| Standard deviation | 8.707259384 |
|---|---|
| Coefficient of variation (CV) | 0.9118115166 |
| Kurtosis | -0.8672319936 |
| Mean | 9.549407115 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 1.004814648 |
| Sum | 4832 |
| Variance | 75.81636598 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=9)
| Value | Count | Frequency (%) | |
| 24 | 132 | 26.1% | |
| 5 | 115 | 22.7% | |
| 4 | 110 | 21.7% | |
| 3 | 38 | 7.5% | |
| 6 | 26 | 5.1% | |
| 2 | 24 | 4.7% | |
| 8 | 24 | 4.7% | |
| 1 | 20 | 4.0% | |
| 7 | 17 | 3.4% |
| Value | Count | Frequency (%) | |
| 1 | 20 | 4.0% | |
| 2 | 24 | 4.7% | |
| 3 | 38 | 7.5% | |
| 4 | 110 | 21.7% | |
| 5 | 115 | 22.7% |
| Value | Count | Frequency (%) | |
| 24 | 132 | 26.1% | |
| 8 | 24 | 4.7% | |
| 7 | 17 | 3.4% | |
| 6 | 26 | 5.1% | |
| 5 | 115 | 22.7% |
| Distinct | 66 |
|---|---|
| Distinct (%) | 13.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 408.2371542 |
|---|---|
| Minimum | 187 |
| Maximum | 711 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 4.0 KiB |
Quantile statistics
| Minimum | 187 |
|---|---|
| 5-th percentile | 222 |
| Q1 | 279 |
| median | 330 |
| Q3 | 666 |
| 95-th percentile | 666 |
| Maximum | 711 |
| Range | 524 |
| Interquartile range (IQR) | 387 |
Descriptive statistics
| Standard deviation | 168.5371161 |
|---|---|
| Coefficient of variation (CV) | 0.4128411987 |
| Kurtosis | -1.142407992 |
| Mean | 408.2371542 |
| Median Absolute Deviation (MAD) | 73 |
| Skewness | 0.6699559418 |
| Sum | 206568 |
| Variance | 28404.75949 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 666 | 132 | 26.1% | |
| 307 | 40 | 7.9% | |
| 403 | 30 | 5.9% | |
| 437 | 15 | 3.0% | |
| 304 | 14 | 2.8% | |
| 264 | 12 | 2.4% | |
| 398 | 12 | 2.4% | |
| 384 | 11 | 2.2% | |
| 277 | 11 | 2.2% | |
| 224 | 10 | 2.0% | |
| Other values (56) | 219 | 43.3% |
| Value | Count | Frequency (%) | |
| 187 | 1 | 0.2% | |
| 188 | 7 | 1.4% | |
| 193 | 8 | 1.6% | |
| 198 | 1 | 0.2% | |
| 216 | 5 | 1.0% |
| Value | Count | Frequency (%) | |
| 711 | 5 | 1.0% | |
| 666 | 132 | 26.1% | |
| 469 | 1 | 0.2% | |
| 437 | 15 | 3.0% | |
| 432 | 9 | 1.8% |
PTRATIO
Real number (ℝ≥0)
| Distinct | 46 |
|---|---|
| Distinct (%) | 9.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 18.4555336 |
|---|---|
| Minimum | 12.6 |
| Maximum | 22 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 4.0 KiB |
Quantile statistics
| Minimum | 12.6 |
|---|---|
| 5-th percentile | 14.7 |
| Q1 | 17.4 |
| median | 19.05 |
| Q3 | 20.2 |
| 95-th percentile | 21 |
| Maximum | 22 |
| Range | 9.4 |
| Interquartile range (IQR) | 2.8 |
Descriptive statistics
| Standard deviation | 2.164945524 |
|---|---|
| Coefficient of variation (CV) | 0.1173060379 |
| Kurtosis | -0.2850913833 |
| Mean | 18.4555336 |
| Median Absolute Deviation (MAD) | 1.15 |
| Skewness | -0.8023249269 |
| Sum | 9338.5 |
| Variance | 4.686989121 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=46)
| Value | Count | Frequency (%) | |
| 20.2 | 140 | 27.7% | |
| 14.7 | 34 | 6.7% | |
| 21 | 27 | 5.3% | |
| 17.8 | 23 | 4.5% | |
| 19.2 | 19 | 3.8% | |
| 17.4 | 18 | 3.6% | |
| 18.6 | 17 | 3.4% | |
| 19.1 | 17 | 3.4% | |
| 18.4 | 16 | 3.2% | |
| 16.6 | 16 | 3.2% | |
| Other values (36) | 179 | 35.4% |
| Value | Count | Frequency (%) | |
| 12.6 | 3 | 0.6% | |
| 13 | 12 | 2.4% | |
| 13.6 | 1 | 0.2% | |
| 14.4 | 1 | 0.2% | |
| 14.7 | 34 | 6.7% |
| Value | Count | Frequency (%) | |
| 22 | 2 | 0.4% | |
| 21.2 | 15 | 3.0% | |
| 21.1 | 1 | 0.2% | |
| 21 | 27 | 5.3% | |
| 20.9 | 11 | 2.2% |
B
Real number (ℝ≥0)
| Distinct | 357 |
|---|---|
| Distinct (%) | 70.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 356.6740316 |
|---|---|
| Minimum | 0.32 |
| Maximum | 396.9 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 4.0 KiB |
Quantile statistics
| Minimum | 0.32 |
|---|---|
| 5-th percentile | 84.59 |
| Q1 | 375.3775 |
| median | 391.44 |
| Q3 | 396.225 |
| 95-th percentile | 396.9 |
| Maximum | 396.9 |
| Range | 396.58 |
| Interquartile range (IQR) | 20.8475 |
Descriptive statistics
| Standard deviation | 91.29486438 |
|---|---|
| Coefficient of variation (CV) | 0.255961624 |
| Kurtosis | 7.226817549 |
| Mean | 356.6740316 |
| Median Absolute Deviation (MAD) | 5.46 |
| Skewness | -2.890373712 |
| Sum | 180477.06 |
| Variance | 8334.752263 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 396.9 | 121 | 23.9% | |
| 393.74 | 3 | 0.6% | |
| 395.24 | 3 | 0.6% | |
| 376.14 | 2 | 0.4% | |
| 394.72 | 2 | 0.4% | |
| 395.63 | 2 | 0.4% | |
| 392.8 | 2 | 0.4% | |
| 395.56 | 2 | 0.4% | |
| 390.94 | 2 | 0.4% | |
| 393.68 | 2 | 0.4% | |
| Other values (347) | 365 | 72.1% |
| Value | Count | Frequency (%) | |
| 0.32 | 1 | 0.2% | |
| 2.52 | 1 | 0.2% | |
| 2.6 | 1 | 0.2% | |
| 3.5 | 1 | 0.2% | |
| 3.65 | 1 | 0.2% |
| Value | Count | Frequency (%) | |
| 396.9 | 121 | 23.9% | |
| 396.42 | 1 | 0.2% | |
| 396.33 | 1 | 0.2% | |
| 396.3 | 1 | 0.2% | |
| 396.28 | 1 | 0.2% |
| Distinct | 438 |
|---|---|
| Distinct (%) | 90.1% |
| Missing | 20 |
| Missing (%) | 4.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 12.7154321 |
|---|---|
| Minimum | 1.73 |
| Maximum | 37.97 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 4.0 KiB |
Quantile statistics
| Minimum | 1.73 |
|---|---|
| 5-th percentile | 3.7075 |
| Q1 | 7.125 |
| median | 11.43 |
| Q3 | 16.955 |
| 95-th percentile | 27.15 |
| Maximum | 37.97 |
| Range | 36.24 |
| Interquartile range (IQR) | 9.83 |
Descriptive statistics
| Standard deviation | 7.155870816 |
|---|---|
| Coefficient of variation (CV) | 0.5627705579 |
| Kurtosis | 0.5186825176 |
| Mean | 12.7154321 |
| Median Absolute Deviation (MAD) | 4.795 |
| Skewness | 0.908891837 |
| Sum | 6179.7 |
| Variance | 51.20648713 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 7.79 | 3 | 0.6% | |
| 6.36 | 3 | 0.6% | |
| 8.05 | 3 | 0.6% | |
| 14.1 | 3 | 0.6% | |
| 18.13 | 3 | 0.6% | |
| 30.81 | 2 | 0.4% | |
| 4.59 | 2 | 0.4% | |
| 7.39 | 2 | 0.4% | |
| 12.67 | 2 | 0.4% | |
| 5.29 | 2 | 0.4% | |
| Other values (428) | 461 | 91.1% | |
| (Missing) | 20 | 4.0% |
| Value | Count | Frequency (%) | |
| 1.73 | 1 | 0.2% | |
| 1.92 | 1 | 0.2% | |
| 1.98 | 1 | 0.2% | |
| 2.47 | 1 | 0.2% | |
| 2.87 | 1 | 0.2% |
| Value | Count | Frequency (%) | |
| 37.97 | 1 | 0.2% | |
| 36.98 | 1 | 0.2% | |
| 34.77 | 1 | 0.2% | |
| 34.41 | 1 | 0.2% | |
| 34.37 | 1 | 0.2% |
Price
Real number (ℝ≥0)
| Distinct | 229 |
|---|---|
| Distinct (%) | 45.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 22.53280632 |
|---|---|
| Minimum | 5 |
| Maximum | 50 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 4.0 KiB |
Quantile statistics
| Minimum | 5 |
|---|---|
| 5-th percentile | 10.2 |
| Q1 | 17.025 |
| median | 21.2 |
| Q3 | 25 |
| 95-th percentile | 43.4 |
| Maximum | 50 |
| Range | 45 |
| Interquartile range (IQR) | 7.975 |
Descriptive statistics
| Standard deviation | 9.197104087 |
|---|---|
| Coefficient of variation (CV) | 0.408165053 |
| Kurtosis | 1.495196944 |
| Mean | 22.53280632 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | 1.108098408 |
| Sum | 11401.6 |
| Variance | 84.58672359 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 50 | 16 | 3.2% | |
| 25 | 8 | 1.6% | |
| 22 | 7 | 1.4% | |
| 21.7 | 7 | 1.4% | |
| 23.1 | 7 | 1.4% | |
| 19.4 | 6 | 1.2% | |
| 20.6 | 6 | 1.2% | |
| 13.8 | 5 | 1.0% | |
| 21.4 | 5 | 1.0% | |
| 20.1 | 5 | 1.0% | |
| Other values (219) | 434 | 85.8% |
| Value | Count | Frequency (%) | |
| 5 | 2 | 0.4% | |
| 5.6 | 1 | 0.2% | |
| 6.3 | 1 | 0.2% | |
| 7 | 2 | 0.4% | |
| 7.2 | 3 | 0.6% |
| Value | Count | Frequency (%) | |
| 50 | 16 | 3.2% | |
| 48.8 | 1 | 0.2% | |
| 48.5 | 1 | 0.2% | |
| 48.3 | 1 | 0.2% | |
| 46.7 | 1 | 0.2% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| CRIM | ZN | INDUS | CHAS | NOX | RM | AGE | DIS | RAD | TAX | PTRATIO | B | LSTAT | Price | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0.01 | 18.00 | 2.31 | 0.00 | 0.54 | 6.58 | 65.20 | 4.09 | 1 | 296 | 15.30 | 396.90 | 4.98 | 24.00 |
| 1 | 0.03 | 0.00 | 7.07 | 0.00 | 0.47 | 6.42 | 78.90 | 4.97 | 2 | 242 | 17.80 | 396.90 | 9.14 | 21.60 |
| 2 | 0.03 | 0.00 | 7.07 | 0.00 | 0.47 | 7.18 | 61.10 | 4.97 | 2 | 242 | 17.80 | 392.83 | 4.03 | 34.70 |
| 3 | 0.03 | 0.00 | 2.18 | 0.00 | 0.46 | 7.00 | 45.80 | 6.06 | 3 | 222 | 18.70 | 394.63 | 2.94 | 33.40 |
| 4 | 0.07 | 0.00 | 2.18 | 0.00 | 0.46 | 7.15 | 54.20 | 6.06 | 3 | 222 | 18.70 | 396.90 | NaN | 36.20 |
| 5 | 0.03 | 0.00 | 2.18 | 0.00 | 0.46 | 6.43 | 58.70 | 6.06 | 3 | 222 | 18.70 | 394.12 | 5.21 | 28.70 |
| 6 | 0.09 | 12.50 | 7.87 | NaN | 0.52 | 6.01 | 66.60 | 5.56 | 5 | 311 | 15.20 | 395.60 | 12.43 | 22.90 |
| 7 | 0.14 | 12.50 | 7.87 | 0.00 | 0.52 | 6.17 | 96.10 | 5.95 | 5 | 311 | 15.20 | 396.90 | 19.15 | 27.10 |
| 8 | 0.21 | 12.50 | 7.87 | 0.00 | 0.52 | 5.63 | 100.00 | 6.08 | 5 | 311 | 15.20 | 386.63 | 29.93 | 16.50 |
| 9 | 0.17 | 12.50 | 7.87 | NaN | 0.52 | 6.00 | 85.90 | 6.59 | 5 | 311 | 15.20 | 386.71 | 17.10 | 18.90 |
Last rows
| CRIM | ZN | INDUS | CHAS | NOX | RM | AGE | DIS | RAD | TAX | PTRATIO | B | LSTAT | Price | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 496 | 0.29 | 0.00 | 9.69 | 0.00 | 0.58 | 5.39 | 72.90 | 2.80 | 6 | 391 | 19.20 | 396.90 | 21.14 | 19.70 |
| 497 | 0.27 | 0.00 | 9.69 | 0.00 | 0.58 | 5.79 | 70.60 | 2.89 | 6 | 391 | 19.20 | 396.90 | 14.10 | 18.30 |
| 498 | 0.24 | 0.00 | 9.69 | 0.00 | 0.58 | 6.02 | 65.30 | 2.41 | 6 | 391 | 19.20 | 396.90 | 12.92 | 21.20 |
| 499 | 0.18 | 0.00 | 9.69 | 0.00 | 0.58 | 5.57 | 73.50 | 2.40 | 6 | 391 | 19.20 | 395.77 | 15.10 | 17.50 |
| 500 | 0.22 | 0.00 | 9.69 | 0.00 | 0.58 | 6.03 | 79.70 | 2.50 | 6 | 391 | 19.20 | 396.90 | 14.33 | 16.80 |
| 501 | 0.06 | 0.00 | 11.93 | 0.00 | 0.57 | 6.59 | 69.10 | 2.48 | 1 | 273 | 21.00 | 391.99 | NaN | 22.40 |
| 502 | 0.05 | 0.00 | 11.93 | 0.00 | 0.57 | 6.12 | 76.70 | 2.29 | 1 | 273 | 21.00 | 396.90 | 9.08 | 20.60 |
| 503 | 0.06 | 0.00 | 11.93 | 0.00 | 0.57 | 6.98 | 91.00 | 2.17 | 1 | 273 | 21.00 | 396.90 | 5.64 | 23.90 |
| 504 | 0.11 | 0.00 | 11.93 | 0.00 | 0.57 | 6.79 | 89.30 | 2.39 | 1 | 273 | 21.00 | 393.45 | 6.48 | 22.00 |
| 505 | 0.05 | 0.00 | 11.93 | 0.00 | 0.57 | 6.03 | NaN | 2.50 | 1 | 273 | 21.00 | 396.90 | 7.88 | 11.90 |